RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: Q 9/ y^'O s 9 /I 

Source: 



Date Processed by STIC: J> ~X J~Q I 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21heln@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 




Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 
ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



SERIAL NUMBER: 



1 Wrapped Nucleics 

Wrapped Aminos 



The number/text at the end of each line '"wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Jnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 Misaligned Amino 

Numbering 

4 Non-ASCII 



_ Variable Length 



6 Patentln 2.0 

"bug" 



The numbering under each 5 m amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 

Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences^) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



7 Skipped Sequences Sequence(s) 

(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 Skipped Sequences Sequence(s) 

(NEW RULES) 



_ missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

_Use of n's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

(NEW RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



10 V Invalid <213> 
Response 



Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



11 



Use of<220> 



Sequence(s) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



12 



_PatentIn 2.0 
"bug" 



Use of <220> to <223> is MANDATORY if <2 1 3> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 
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OIPE 



RAW SEQUENCE LISTING DATE: 05/23/2001 

PATENT APPLICATION: US/09/750 , 98 6A TIME: 12:37:52 

Input Set : A:\B-99982.txt — 
Output Set: C:\CRF3\05232001\l750986A.raw B£ 



ERED 

5" 



3 <110> APPLICANT: Steinbuchel, Alexander £CL <? f > - 

4 Priefert, Horst 

5 Rabenhorst, Jurgen 

7 <120> TITLE OF INVENTION: SYNTHETIC ENZYMES FOR THE PRODUCTION OF CONIFERYT. 

8 ALCOHOL, CONIFERYLALDEHYDE, FERULIC ACID, VANILLIN AND 

9 VANILLIC ACID AND THEIR USE 

11 <130> FILE REFERENCE: Bayer-9998 . 2-HCL 

13 <140> CURRENT APPLICATION NUMBER: 09/750, 986A HftfiS Not COfif»p»y . 

14 <141> CURRENT FILING DATE: 2000-12-28 rA Hiskette N e8C ^ 

16 <150> PRIOR APPLICATION NUMBER: 196 49 655.1 GERMANY COfT© C * eCl 

17 <151> PRIOR FILING DATE: 1996-11-29 

19 <160> NUMBER OF SEQ ID NOS : 45 P' 
21 <170> SOFTWARE: Patentln Ver. 2.1 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 32679 

25 <212> TYPE: DNA 

26 <213> ORGANISM: Pseudomonas sp. 

28 <220> FEATURE: 

29 <221> NAME /KEY : CDS 

30 <222> LOCATION: { 314 6) . . ( 3997 ) 

31 <223> OTHER INFORMATION: gene = "ORFl" 

33 <400> SEQUENCE : 1 

34 gaattcatcc tcatggagca cttctacaag cagcaggcag gccaccctcc ccagaccgat 60 
36 gacgtgcata ttatcgcgat cggcggaacg agctttaaac gctacctgga gctcggaaag 120 _ 
38 ctcctgaaca tcagagttgc cgcaattcga gataacgacg gtgactatca gcagaactgt 180 
40 gtagcgaact acgaaggcta cctgtacgag tcggccaaga ttttcgccgc cccagatcct 240 
42 gaccgaagca ccttcgaaat agggctgtac cgtgacaacc agaaagcctg tgacgatctc 300 
44 tttgttgcgg gtcgcaaaaa actgaccgtg caagagtaca tgctcaaaaa taaagcggat 360 
46 gccgctttcg agctgctgac caagaagtcc gctgaactga tcgccccgaa gtacatacag 420 
48 gaagcgatcg aatggataag agcgtaattt tctccgtcgc aggatccggg aaaaccagcc 480 
50 tgatcatcga gcgtctcagc cttgatcagc gggcattggt catcacttac acggacaaca 540 
52 atcaccggca cctgcgcaac aggatcattc agagattcgg ggtgatccca tccaacatca 600 
54 cgctcatgac gtacttctcg ■ ttcctgcatg ggttctgcta tcggcccttg atgcaattgc 660 
56 agctaggaac acgaggccta aatttcagac gtccgcccaa caggcagtac cccctgaacg 720 
58 atctcaatcg gtatcgcgat ggaagcggca ggctctatca ctgccgcctc gcgaaactgc 780 
60 tggacgttgc gcaggcctta ccggatgtgc gtgcccgcct ggagcgcttt tacgactgcc 840 
62 tgtacgtcga cgaggtacag gatttcgcgg gtcacgactt caacctcctg ctggaggttt 900 
64 cacgggcgaa gatcggcatg acgttcgtcg gtgatttcca ccagcacacc ttcgatacca 960 
66 gccgagacgg agcggtaaac aaaacccttc acgacgatgc cgttcgctac gagaagcgct 1020 
68 ttcgtgatgc cggcatttcg gtggacaagc aaacgttgaa ccgcagctgg cgatgcgcca 1080 
70 aaacggtctg tgacttcatc agcgcaaagc tgaaaattgg cgatggacgc tcacgaggag 1140 
72 cggggcagcc ggatcattag agttgatgac caagagcagg ccaacttgtt gcacgttgac 1200 
74 ccaaccatcg tgaagctgtt tttgagcgaa cactacaagt acggctgcca ctccgaaaac 1260 
76 tggggggcaa gcaaggcatg gatcacttta acgatgtctg cgttgtgatg ggcccgggta 1320 
78 tctggaaaga ctatgtggct gagaggttac accaggccaa cccgcaaacc cgaaacaagc 1380 
80 tgtacgtggc ctgcactagg gcgcggggtg atctgtattt cgtgcctgag aagctcttga 1440 
82 gggccttcaa acagggaaat taggcgataa agctgaaaaa ggattttcaa qtaaagacca 1500 
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84 ctccttcctt actcgatgtc cgcttttggc cgatttctgc cagtcacgac cggcaaagaa 1560 
86 cggccaaaag cggactgatg cggttactaa gcctgcctct tattgaagct tggtgggctt 1620 
88 taagaatgtg gtgcgatcca gcctgatgat gttccgcttt atgcacgcag ccaagcctat 1680 
90 cgaccgccgt ctgcacgttg taaccgacta cgcctgtgcc tttgccgctg gtggccatgg 1740 
92 agcgtgcatc cggatcggtg agtgagactt gcccatccgg tgcttcacgt agctgctgct 1800 
94 ccatctcctt gagcgcctgc atctgctggc ggagtttctc gattttatcc tggaggcggc 1860 
96 tggctttggc ttcggcgaca tcggattgag ttctgtcggc ggtgtccatc gctgccagat 1920 
98 agcggtcgat gattttatca atctggtcca tccgggcgcg cacccgctat ' gatccggagt 1980 

100 C.C.t C.r.nPif P\f rn^t nanncr t a t nt- nnnnf- nn^^n^rrnr^rr f t-onrrfrrf f ■(-/-< = rr d ^ +- +- n 1 n a n 

102 gcgagaacgc ggtcaagctc agcttcgggc gccaagcctt caagatcggc aacggcttcc 2100 

104 tgatcggcga aggccacgtc gaccaaggta acgatgcggg ctactggctg gcccctacct 2160 

106 aggcgttcga caacaccgtc ctagcccaac tggacaccgg caagctgcat gtcgacctgt 2220 

108 tcgacctcca ggcgggcatg gatctggacg tcgccgacat caaggagaaa gtccgggtgc 2280 

110 gcgggggcaa cgtcgagtgg cgcgacgaga cctacggcac ggtagggttc accggcttcc 2340 

112 atacgctgga cgctgacaat ccgctgcgcg acggcatgaa tgtctacgac gtacgcgcat 2400 

114 cgggcagccc gatccgagcc ctgccgcagg tggccctggc ggcggagtac gcctggcagc 2460 

116 gcggcggcga ggcggacaag acgagtgagg cctggtacct acagggcagc tacacctttc 2520 

118 gggatgcccc ctggacgcca gtgctgatgt accgtcacgc ggtcttctcc gacgactacg 2580 

120 actccctgct gtacggctaa gggggcaaca acatgggctg gaaaggagca ttgcgttgaa 2640 

122 acgatgctga agggcgtcac tcttttactg ctgtccgctc acgtcgaaac tgcatgattt 2700 

124 cgggcagcct ttcttctatc cagtcggcca gcacctgaac atgagccgct acttcctggc 2760 

126 caagcggcgt caggctgtac tcgacatgtg ggggaacgac cgggagcgaa tgtcgagcta 2820 

128 tgaaaccgtc tccctccagg ccttgtaggg tctgcgcaag cattcttttc gctgacaccg 2880 

130 ccgattcttc cgacgcaggt cgctgaatcg atggacaccg tccaccaaga tgatcagcac 2940 

132 gagcacgccc agcggcttgt cacgtgcttg agcacgtccc gcgacggcat tcagcactca 3000 

134 gcaattcccg cgccgtgctt gcatggagag actggtaagg gcggccagcg tgagtttcat 3060 

136 ggcactaacc tttatgtatg tacttacttt tagttgctag tagggatatg gtgacgcctt 3120 



138 


catcctacga 


aacaagtgaa gactg 


\atg 


ate 


gee 


ate 


aca 


ggt 


gee 


tec 


gga 


3172 


139 
















Met 


He 


Ala 


He 


Thr 


Gly 


Ala 


Ser 


Gly 




140 
















1 








5 












142 


caa 


ctt 


ggt 


egg 


ttg 


act 


ata 


gag 


gcg 


eta 


ctg 


aag 


cgc 


ctg 


cca 


gca 


3220 


143 


Gin 


Leu 


Gly 


Arg . 


Leu 


Thr 


He 


Glu 


Ala 


Leu 


Leu 


Lys 


Arg 


Leu 


Pro 


Ala 




144 


10 










15 










20 










25 




146 


tec 


gaa 


att 


att 


gec 


etc 


gtc 


egg 


gat 


ccg 


aat 


aag 


gee 


gga 


gac 


ctt 


3268 


147 


Ser 


Glu 


He 


He 


Ala 


Leu 


Val 


Arg 


Asp 


Pro 


Asn 


Lys 


Ala 


Gly 


Asp 


Leu 




148 










30 










35 










40 






150 


acc 


gca 


cgt 


ggc 


ate 


gtg 


gtg 


cgc 


cag 


gee 


gat 


tac 


aac 


egg 


ccg 


gaa 


3316 


151 


Thr 


Ala 


Arg 


Gly 


He 


Val 


Val 


Arg 


Gin 


Ala 


Asp 


Tyr 


Asn 


Arg 


Pro 


Glu 




152 








45 










50 










55 








154 


aca 


etc 


cac 


egg 


gee 


ctg 


att 


ggg 


gtc 


aac 


egg 


ttg 


ctg 


ttg 


att 


tec 


3364 


155 


Thr 


Leu 


His 


Arg 


Ala 


Leu 


He 


Gly 


Val 


Asn 


Arg 


Leu 


Leu 


Leu 


He 


Ser 




156 






60 










65 










70 










158 


tec 


agt 


gag 


gtg 


ggt 


caa 


cga 


act 


gcg 


caa 


cac 


egg 


gca 


gtg 


ate 


gac 


3412 


159 


Ser 


Ser 


Glu 


Val 


Gly 


Gin 


Arg 


Thr 


Ala 


Gin 


His 


Arg 


Ala 


Val 


He 


Asp 




160 




75 










80 










85 












162 


get 


gcg 


aag 


caa 


gaa 


ggt 


ate 


gag 


ttg 


ctg 


get 


tat 


acg 


agt 


ctg 


ctt 


3460 


163 


Ala 


Ala 


Lys 


Gin 


Glu 


Gly 


He 


Glu 


Leu 


Leu 


Ala 


Tyr 


Thr 


Ser 


Leu 


Leu 




164 


90 










95 










100 










105 




166 


cat 


gee 


gat 


aaa 


teg 


gcg 


ctg 


ggc 


eta 


gcg 


act 


gaa 


cac 


cga 


gac 


acg 


3508 
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167 


His 


Ala 


Asp 


Lys 


Ser 


Ala 


Leu 


Gly 


Leu 


Ala 


Thr 


Glu 


His 


Arg 


Asp 


Thr 




168 










110 










115 










120 






172 


gaa 


cag 


qcc 


ctq 


aca 


aaa 


tec 


aat 


att 


cct 


cat 


gt c 


ctg 


tt a 

u i_y 


cgc 


aac 


3556 


173 


Glu 


Gin 


Ala 


Leu 


Thr 


Glu 


Ser 


Gly 


He 


Pro 


His 


Val 


Leu 


Leu 


Arg 


Asn 




174 








125 










130 










135 








176 


ggt 


tgg 


tat 


cac 


qaa 


aac 


tac 


acg 


oca 


aac 


ate 


cca 


gt c 


aca 
y ^y 


ctg 


at t 

y l i_ 


3604 


177 


Gly 


Trp 


Tyr 


His 


Glu 


Asn 


Tyr 


Thr 


Ala 


Gly 


He 


Pro 


Val 


Ala 


Leu 


Val 




178 






140 










145 










150 










180 


cat 


"TJ TJ ~ 


rtf- n 


■h-hrr 

*3 


r- f- rr 


rrrrr' 






pan 




n m r< 
^ y-j w 


■h -h /-T 
*- ^ ^ 


^■h +- 




+■ 


gc u 


3652 


181 


His 


Gly 


Val 


Leu 


Leu 


Gly 


Cys 


Ala 


Gin 


Asp 


Gly 


Leu 


He 


Ala 


Ser 


Ala 




182 




155 










160 










165 












184 


gca 


cgt 


get 


ga c 


tac 


gee 


ga a 


gca 


y 


get 


y L y 


y L y 




3 r 1 r 1 


ggt 


gag 


J / UU 


185 


Ala 


Arg 


Ala 


Asp 


Tyr 


Ala 


Glu 


Ala 


Ala 


Ala 


Val 


Val 


Leu 


Thr 


Gly 


Glu 




186 


170 










175 










180 










185 




188 


aat 


cag 


gca 


oat 


cgc 


gt c 


tac 


craa 


ctg 


gec 


aat 

y y 


gaa 


ccg 


gca 


tat 


p r* rr 


^7 4 R 


189 


Asn 


Gin 


Ala 


Gly 


Arg 


Val 


Tyr 


Glu 


Leu 


Ala 


Gly 


Glu 


Pro 


Ala 


T vr 


Thr 




190 










190 










195 










200 






192 


etc 


acc 


gaa 


ct g 


gca 


get 




at a 

y uy 


aca 


ccg 


caa 


gca 


y y « 


aag 


acc 


rr"h r* 


^7 Q 


193 


Leu 


Thr 


Glu 


Leu 


Ala 


Ala 


Glu 


Val 


Ala 


Pro 


Gin 


Ala 


Gly 


Lys 


Thr 


Val 




194 








205 










210 










215 








196 


gtg 


tat 


teg 


aac 


eta 


tec 


gag 


age 


gat 


tac 


cga 


tct 


gcg 


ttg 


ate 


agt 


3844 


197 


Val 


Tyr 


Ser 


Asn 


Leu 


Ser 


Glu 


Ser 


Asp 


Tyr 


Arg 


Ser 


Ala 


Leu 


He 


Ser 




198 






220 










225 










230 










200 


gcg 


ggc 


ctt 


ccc 


gat 


ggt 


ttt 


gcg 


gca 


ttg 


etc 


gca 


gac 


tct 


gat 


gca 


3892 


201 


Ala 


Gly 


Leu 


Pro 


Asp 


Gly 


Phe 


Ala 


Ala 


Leu 


Leu 


Ala 


Asp 


Ser 


Asp 


Ala 




202 




235 










240 










245 












204 


ggc 


gca 


gec 


aag 


ggg 


tat 


ttg 


ttt 


gat 


tec 


agt 


gga 


gac 


agt 


cgc 


aag 


3940 


205 


Gly 


Ala 


Ala 


Lys 


Gly 


Tyr 


Leu 


Phe 


Asp 


Ser 


Ser 


Gly 


Asp 


Ser 


Arg 


Lys 




206 


250 










255 










260 










265 




208 


ctg 


ate 


ggt 


cgc. 


cca 


acc 


act 


ccg 


atg 


teg 


gaa 


gee 


ate 


gcg 


gca 


gca 


3988 


209 


Leu 


He 


Gly 


Arg 


Pro 


Thr 


Thr 


Pro 


Met 


Ser 


Glu 


Ala 


He 


Ala 


Ala 


Ala 





210 ^ 270 275 280 

212 att ggc cgc taaaactgea ttttcgegae ttgagtgaca cctgggttag 4037 

213 He 'Gly Arg 



215 ataacccagg tgtctcgcac cgctttgggt tagtggtggg caatageggt gtctggtcac 4097 

217 cgcttgcccg gcggcgcgcc cgctattgga tgattctcaa cttcctggtg ccggcgtctt 4157 

219 gttggggccc aaacaggegg geataacgea atgtggcatt tgcactgtcg cgcatgatgg 4217 

221 cttctgctcg agcaccttgc ccgctaatca gcgcgtctac cacagcatga tgctgcatgt 4277 

223 tggcaaaatt gaaceggegg tactcttggg gaggttgeta ccgtcgacgg ccagtgaact 4337 

225 gacagaggca aagggcaggt gttcattccg agecaatget tcacctatgg cagcgttacc 4 397 

227 gctggcatcc acgatagctt gatggaagcg cttgttgatg tcgtggtatt eggegaggtc 4457 

229 gtcttcgctg acataacctt tctcaaatag ggcatcgccc tgggecaage actgeaagag 4517 

231 gatctcttgc gtttcactgg atagccctcg ctcggcagcc tgecttgegg ccagtccttc. 4577 

233 aagtacccct cgaacctcca ccgcgcctgc caggtcattt ggggtcattt gccgcactgc 4637 

235 atagecaegt gcgccttggc gatcagtaac ccttcctgtt ctagcgctcg ■ gaacgcaatg 4697 

237 eggataggtg tgcgccgaca ctcccaggcg cteggcagtg gggatttegg cgatgcgctc 4757 

239 tcctgccggg agttcgccat ccacaatcat tttgcgcagt agattgagta ctcgctgccc 4817 

241 gggcccgctc atttcagect ccgattggat ccagtaatgg tttgagagaa ttttactege 4877 

243 aagggatttc tgggcaatag ccccgctgat tgctggtttt tgtatgtggc gtgegactat 4937 
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24 5 cgcacagaat 
247 atattggatc 
249 tccaaaaaca 
251 gcttgcactc 
253 attgtcttct 
255 cgcggggcac 
257 ggtttggaaa 
259 ttcccttgca 



tggatccacc 
ccataaggtc 
agatcgccat 
cggatgaaat 
atcgggggcc 
cgttgtccct 
tgggctgcga 
tcaaaagcta 



ttggcgcaaa 
aagttcatag 
tgaggaacgc 
cgcagataag 
ggaaggacgt 
gggtttcgtt 
gggcaaaacg 
cgcggtagaa. 



aaaactggag 
ctgattttgg 
gccatgtttc 
ccgctaggcc 
gttgccgcgg 
cgcgacggta 
ctcgcgatgc 
gagcgatacg 



ctacct cat c 
ctttagatgt 
cgaaaaacgc 
gtcagatctg 
tagaggattt 
agctgatttg 
ccgggcagcg 
gctttatctg 



ggtcgtggtt 
ccattgtgga 
ctggtatgtc 
caacgaaaag 
ctgccctcat 
cggctaccac 
cgttcaaggc 
ggtatggcct 



4997 
5057 
5117 
5177 
5237 
5297 
5357 
5417 
5 4 7 7 
5537 
5597 
5657 
5717 
5777 
5837 
5897 
5957 
6017 
6077 
6137 
6197 
6257 
6317 
6377 
6437 
6497 
6557 
6617 
6677 
6737 
6797 
6857 
6917 
6977 
7037 
7097 
7157 
7217 
7277 
7337 
7397 
7457 
7517 
7577 
7637 
7697 
7757 
7817 
7877 



263 gagtgggcct 
265 aacctcatgg 
267 attgacgagg 
269 atggataacg 
271 gacgatgtac 
273 atcgaagtag 
275 gccggcagca 
277 tggggcatgg 
279 ggtcagggca 
281 ctggcctacc 
283 cggcgcgtca 
285 gcgagaagtg 
287 tcattagcct 
289 cccacgtcga 
291 caggggcgca 
293 cgaaggcggt 
295 acctattccc 
297 ttacgccgat 
299 attattgcgc 
301 ttgctgatca 
303 cagcgctgct 
305 tcatggggca 
307 gagagtattt 
309 ttcacagcac 
311 atgcggccgg 
313 ctcgggtggt 
315 caaagaacga 
317 atctctaact 
319 agtaaacagg 
321 cccgctcttt 
323 ttacaagttt 
325 acccaatcag 
327 cctggtgcgg 
329 tgattccgag 
331 ggtgggcgag 
333 cgaatgccgt 
335 tgctactcag 
337 gccgatctac 
339 ctccctggcg 
341 cgtgaccacc 



atggtggcgg 
atctcaccca 
caccggtcag 
tcatggcccc 
cggttgatcg 
gtgtggctca 
tagtggtcga 
ctcgcaactt 
agatttttgc 
cggagcgcca 
ttgatcggat 
catcatgatt 
tgagtttgtc 
tgtgcatctt 
gagccattac 
gcacgagaat 
attggaagag 
tctgtgtatg 
gcgttcgacc 
cgt acgtt tc 
agcggccgag 
tgtgcttgat 
cgccgcggcg 
cggacaagtg 
aattatcgtt 
agacggagag 
ccagttcacc 
catccccgtg 
ctcaaccgtt 
tcatgcacca 
attcagaagg 
ccattgcaga 
gtcgtggcca 
ggcgttttcc 
ggcgtcacct 
gagtgcaagt 
ggcaagggtc 
cactacatgg 
aagattccca 
ggcattggcg 



tctctaccac 
tgagacctat 
tactcgtgtc 
tccgttctgg 
ctggcagatc 
tgcgggcaaa 
cttcatcacg 
ccgtccgcag 
cgaggacctg 
gttgctcaag 
tctcgcagct 
gaggtaatca 
cgggctgacg 
cctggcggcc 
tgcatcggtg 
cttcgcgtcg 
ggggtggagc 
get caagaat 
gaecgagegg 
cactttgaca 
gecgaaggta 
accgegaagg 
ccgaatgtga 
cttcaggtcc 
cccgtttctt 
cctgatcatc 
ccctgttgct 
tccggtcccc 
tttagcggga 
agecattcat 
aaatcggaat 
tegtcgaagt 
ccggcgtttg 
cctgcatcct 
cgctggcggt 
tcttcaagtc 
tgatgccgga 
gctgctcgac 
agaatgegee 
cggtgctgaa 



atcgcttgtg 
gtgeatgect 
gagggegaca 
cgtgctgcgc 
tgecgatteg 
ggcggatatg 
ccggagagtg 
ggcaeggage 
gaeatgetgg 
ctgaatatcg 
gaacaagagg 
ttteggegat 
gtggcttgct 
tgatteggea 
ttctgaagga 
ggatgcgcgt 
ggagtctget 
tagcagcacg 
cgttcgttga 
atggcccgga 
cccaccttta 
agcagggctg 
gtgetgaega 
ccgcggatca 
gtgagcaggg 
gtgacttctt 
cgcgagccaa 
tgctttggtg 
taaccattct 
agtcaccagc 
gatcaaatcc 
ggacgtggct 
ccacaccgat 
tggtcacgaa 
cggcgaccac 
cggcaagacc 
cggcacctcc 
cttctccgag 
gctggagaaa 
cactgccaag 



attaccgcct 
ccagcatcgg 
ccgtgattac 
ttcgtggcaa 
ctcctccgag 
acgcgccggc 
atacctcgat 
tgactgaaac 
agcagcagca 
atgeeggegg 
ccgcagacgc 
gcgcttggtt 
tccgcctgtc 
gtactcgctc 
cccggcgtct 
gcaaattagc 
gttcgeggge 
cgagcaagat 
atggcttaag 
t cagcaaaaa 
tgtctgtggg 
ggctgacaat 
tggcagtttc 
aacggtctcc 
catctgeggt 
cctcacggat 
gagegectgt 
eggeggactg 
tgaggatgaa 
tgcttctacg 
cgcgccgctg 
ccgcccaagg 
gcctacaccc 
ggeggeggea 
gtgatcccgc 
aacctgtgcc 
cgcttcagct 
tacaccgtgc 
gtctgcctgc 
gtggaggagg 



gatgatcgac 
tcaaaaggaa 
cagccggtac 
cggcttggcc 
teaegtactg 
ggaatacaag 
ttggtacttc 
cattcgtgtt 
gcgcaatctg 
ggttcagtca 
agegctgate 
gctcaggaca 
gaggceggeg 
tggaatcaac 
cgtggtggtt 
gagecgagga 
gggattggca 
ttcgagttgc 
gtttgegact 
ctgaatgccg 
cccggcgggt 
cgactgcatc 
gaggtgcgga 
caggtgeteg 
acttgeatea 
geggagaagg 
ttggtcttgg 
tgcgcgggta 
ggagggttat 
tgctgctgcg 
tggcgttcgc 
ccggtgaagt 
tgtccggcgc 
ttgtcgaagc 
tctacacggc 
agaaagtgcg 
acaaeggtea 
tgccggaaat 
tgggctgegg 
gtgctaccgt 
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RAW SEQUENCE LISTING DATE: 05/23/2001 

PATENT APPLICATION: US/09/750 , 986A TIME: 12:37:52 

Input Set : A:\B-99982.txt 

Output Set: C:\CRF3\05232001\l750986A.raw 

343 ggccatcttc ggcctgggcg gcatcggctt ggcggcgatc atcggcgcga agatggccaa 7937 

345 ggcctcgcgc atcatcgcca tcgacatcaa tccgtccaag ttcgatgtgg ctcgcgagct 7997 

347 gggcgccact gacttcgtca atccgaacga tcacgcgaag ccgatccagg atgtcatcgt 8057 

349 cgagatgact gatggcggtg tggactacag cttcgagtgc atcggcaacg ttcgactcat 8117 

351 gcgcgcagca ctcgagtgct gccacaaggg ctggggcgaa tccgtgatca tcggcgtggc 8177 

353 gccggcgggg gccgaaatca acacccgtcc gttccacctg gtgaccggtc gcgtctggcg 8237 

355 gggttcggcg ttcggtggcg taaagggccg caccgaactg ccgagctacg tggagaaggc 8297 

357 acagcagggc gagatcccgc tggacacctt catcactcac accatgggcc tggacgacat 8357 

^ S 9 n ^ p n ^ nnnr- n M-rrrarntrfa +- n- rr a r- n 3 ^ n rr rrn^nortp^f ^ ^^^.-i-^-h^r-h-h^ +- 4- ^. -> -t- +- ~ ^ o a 1 n 
--j ~2 — — ^.v-^-^^wwwv-j^ v,^^^*w^v^^^^ yuuyu^^auw ^yot^Ly^ty u u a a l tyay u i _l / 

361 tcgctagtga agtggggtga ggaaattgga ttaggaggcg gatggttcct gccgcttaac 8477 
363 caccttgtcc cagcttctgg ctgagatttc caagattcgg tgaaatttgc catgccgcaa 8537 
365 actcttgctg gacggttgag tctgttatcc ggcaccgacg aattaaccct gcttcttcgg 8597 
367 ggtggtcggg gcattgagcg tgaagccttg cgggtcgatg ttcaaggtga actggcgctg 8657 
369 acgcctcacc cggcggcgct tggctctgcg ttgacccatc cgacaattac tacggattac 8717 
371 gccgaggccc tgcttgagtt gatcactcgg ccggcaaccg attgtgcgca agccttggct 8777 
373 gagctggagg agcttcaccg tttcgttcat tcgagacttg agggggagta tctctggaat 8837 
375 ctgtccatgc ctggcagatt gccggttgat gagcaaatcc cgattgcttg gtatggacca 8897 
377 tcaaatccag gcatgttgcg ccacgtttat cgccgtggcc tagctctgcg ttatggcaag 8957 
379 cgaatgcaat gcatcgcagg gattcactac aactactcac tgccgccaga gcttttcgct 9017 
381 gtcctgacca aggcagaggt cgggtctccc aagttactgg agcgccagtc agcagcttac 9077 
383 atgcgccaaa ttcgcaacct tcggcaatac ggttggttgc tggcctactt gttcggcgct 9137 ■ 
385 tcccccgcca tctgcaagag cttcttgggg ggcgagagag atgagctagc tcgcatgggg 9197 
387 ggcgatacgc tttacatgcc ctatgcaacc agcttgcgca tgagtgacat cgggtaccgc 9257 
389 aaccgtgcca tggatgatct atctcccagc ctgaatgatc tgggtgccta tattcgcgat 9317 
391 atttgccgtg ctcttcacac tcccgatgcc cagtaccagg cgctgggtgt gtttgcacag 9377 
393 ggcgagtggc ggcagttaaa cgccaatcta ttgcagttgg atagtgagta ctacgcactg 9437 
395 gcgcgaccga agtcagcgcc cgagcggggg gagcgaaacc tggatgctct cgctaggcgt 9497 
397 ggagtccagt atgtggagct gcgcgcactg gatctcgatc cattctcccc gttaggcatt 9557 
399 ggcctgacct gcgccaagtt cctcgatggc tttttgcttt tctgcttgtt gtctgaggcg 9617 
401 ccggttgatg atcgaaatgc ccagcgttca agaccgggaa aatctgagcc tggccggcaa 9677 
403 gtacgggcgt cacctggctt aaagctgcat cggaatggtc agtccattct cctcaaggat 9737 
405 tgggcgcagg aagtgttgac ggaggttcag gcctgtgtgg aattgctcga cagtgcaaat 9797 
407 gggggctcat ctcacgcatt ggcttggtca gcacaggagg aaaaggtgct taatccggat 9857 
409 tgtgcgccat cagctcaggt gctcgcagag atacacagac acggtgggag cttcacggca 9917 
411 tttggtcgcc aattagctat cgaccatgca aaacacttca gtgcctcctc gcttgaggct 9977 
413 ggcgtagcca aagcgcttga cctccaggcg acgtcgtctc tgcgcgagca gcatcaattg 10037 
415 gaggccaacg accgtgcgcc attttctgac taccttcagc aattctccct ggctttcggt 10097 
417 caatccgtcg gcgcctctcg tgcgcccaac cctaccgcgc acctcatcga tctgacccct 10157 
419 cctgtctaag gttgtcgtgg gagcagatcc gtgggccgag cttcctccag ggcctggccg 10217 
421 cagcgatcca gttgctaggt ccctatgctc ttgcataggg taaaaattag ttattgtgtt 10277 
423 taacgaaacg tctggcatac tggctttagg cacgagcttc cacgccgaag ttgagagcgt 10337 
425 catgaacgat ttttcgtgtg gagagacgat gcccgatgcg gtcgacgagg ttcaggtcct 10397 
427 aatggcagtg ccggcggcta aacggaacgt gccgtatttt gaggcttgga gcgtggtgaa 10457 
429 gcagcttggc tgctccctgg gcctgtcagg atcacgctgt gtcggcagtg acacttcaaa 10517 
431 acaagaaggg cattaagatg atgaatgtta attataaggc tgtcggggcg agcctactcc 10577 
433 tcgccttcat ctctcaggga gcttgggcag agagccccgc agcctctggc aatacccctg 10637 
435 acatttatcg aaagacctgc acctactgcc atgagcctac tgtcaacaat ggccgggtca 10697 
437 ttgcccgaag cctcgggccg actctgcgag ggcgccagat ccctccacag tacacggagt 10757 
439 acatggtgcg tcatggacgc ggggcaatgc ctgcattctc tgaagcagaa gtgcctccgg 10817 
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<210> 3 y y 

<211> 1065 \%/3s> oracu^^j a 

<212> DNA ^ ___ir-> J J y 

^213> not required under old ruley ^ / » c /j icZcL.ro /~y /~ <2 Sy> & SO , ess? r 

<220> s-<j/c>S g ^6* 

<221> CDS y , 

<222> (1)..(1062) 

<223> product = "Vanillinsaeure-O-Demethylase" / gene = ' wr / 

"vanA" ^/tf 

<400> 3 vl/ /~>^ 

atg ttt ccg aaa aac gec tgg tat gtc get tgc act ccg gat gaa ate 48 <SM r^<L CLrror 
Met Phe Pro Lys Asn Ala Trp Tyr Val Ala Cys Thr Pro Asp Giu He 

i 5 10 15 Senary Jr/<*<?T, 

re*"'**' 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 
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VERIFICATION SUMMARY DATE: 05/23/2001 

PATENT APPLICATION: US/09/750 , 986A TIME: 12:37:53 

Input Set : A:\B-99982.txt 

Output Set: C:\CRF3\05232001\l750986A.raw 

L:4934 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:44 
L:4937 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:44 
L:4955 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:45 
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